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" HOW LIKELY IS AN I.I.D. DEGREE SEQUENCE TO BE 

O ■ GRAPHICAL? 

o 

By Richard Arratia and Thomas M. Liggett 1 

U 

Q_i| University of Southern California and University of California, Los 

■ Angeles 

Given i.i.d. positive integer valued random variables Di, . . . ,Dn, 
one can ask whether there is a simple graph on n vertices so that 
the degrees of the vertices are D\, . . . ,D n . We give sufficient condi- 
tions on the distribution of Di for the probability that this be the 
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case to be asymptotically 0, § or strictly between and |. These 



' conditions roughly correspond to whether the limit of nP(Di > n) is 

\ infinite, zero or strictly positive and finite. This paper is motivated by 

the problem of modeling large communications networks by random 
graphs. 

1. Introduction and statement of results. The growth of the Internet 
| and other large communications systems over the past few decades has led 

0^ ■ to a significant amount of interest in modeling such systems as random 

\ graphs with a large number of vertices. Many books and papers in both 

' mathematics and applied areas such as computer science have resulted from 

I/"") . this interest. 

Classical mathematical work on random graphs concentrated on the so- 
called Erdos-Renyi graphs. In these graphs, one takes n vertices and con- 
nects pairs of vertices by edges independently with probability p for each 
pair. While these random graphs have a rich mathematical theory [see Bol- 
lobas (2001) or Janson, Luczak and Rucinski (2000), e.g.], it turns out that 
they are not good choices for modeling many complex networks. To illustrate 
^ ■ their limitations, consider the distribution of the degree of a fixed vertex, 

which in Erdos-Renyi graphs is Binomial with parameters n — 1 and p. If 
one takes n large and p small in the usual way, this distribution is approxi- 
mately Poisson. The Poisson distribution has very rapidly decreasing tails. 
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Networks that appear in practice, on the other hand, tend to have degree 
distributions with very large tails, often lacking a third, or even second, mo- 
ment. This paper is a contribution to the problem of constructing random 
graph models in which the distribution of degrees can be prescribed by the 
modeler. 

Many authors have observed that in graphs of complex networks such as 
the Internet, the degree sequence appears to fit a power law of the form 

P(D > k) ~ ck 1 ' 13 . 

One way to mimic the randomness of Erdos-Renyi graphs, while maintaining 
a power law degree distribution, is to fix a law, sample n times from this law 
to get a (potential) degree sequence, and then, if there are graphs with this 
degree sequence, choose a graph uniformly from all graphs with the given 
degree sequence. Albert and Barabasi (2002) provided a good survey of such 
complex networks and models such as that above. To avoid the uncertainty 
of sampling, Aiello, Chung and Lu (2001) took a power law and formed a 
degree sequence by taking di (for 1 < i < n) to be the (ceiling of the) (i/n)th 
quantile of that law. A survey of rigorous results on graphs with power law 
degree distributions appeared in Bollobas and Riordan (2003). 

To implement the procedure described in the preceding paragraph, two 
problems must be solved: 

1. Determine which degree distributions have the property that if the de- 
grees are chosen by sampling from that distribution, then there is sub- 
stantial probability that there exists a simple graph with those degrees if 
n is large. 

2. Given that there is such a graph, find efficient algorithms that will make 
a uniform choice from among these graphs. 

This paper is devoted to the first problem. The second problem was 
treated in several papers: Jerrum and Sinclair (1990), Jerrum, McKay and 
Sinclair (1992) and Steger and Wormald (1999). Note that while the titles 
of these papers often suggest that they treat the problem only for regular 
graphs (in which the degrees are all the same) , the algorithms proposed usu- 
ally apply to nonregular graphs as well, provided that the degrees do not 
vary too much. In fact, conditions on the degree sequence are given which 
imply that the algorithm works well, and then it is observed that these 
conditions are satisfied by regular graphs. 

We now turn to the mathematical formulation of our problem and to 
the statement of our main result. Consider a sequence of positive integers 
d\,...,d n . The decreasing rearrangement of this sequence is mi, . . . ,m n , 
where {mi, . . . ,m n } = {di, . . . ,d n } as multisets and mi > ni2 > ■•■ > m n . 
Erdos and Gallai (1960) showed that there exists a simple graph (i.e., one 
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with no loops or multiple edges) with n vertices whose respective degrees 
are d\,...,d n if and only if Y^i <k is even and their decreasing rearrangement 
satisfies 

j n 

(1.1) ^2m<j(j-i)+ min (i> m i)) l<j<n- 

i=l i=j+l 

Sequences with these properties are called graphical. Some equivalent condi- 
tions for a sequence to be graphical can be found in Sierksma and Hoogeveen 
(1991). 

The necessity of (1.1) is quite easy to check. The left-hand side of (1.1) 
is the total degree of the first j vertices. The first term on the right-hand 
side is the maximum total degree of these vertices coming from edges that 
connect vertices in {1, . . . , j} to other vertices in this set. The second term is 
a bound on the total degree of the vertices in {1, . . . , j} coming from edges 
that connect to vertices in {j + 1, . . . , n}. The proof of the sufficiency of (1.1) 
is more difficult. Proofs can be found in several papers, including Sierksma 
and Hoogeveen (1991) and Choudum (1986). The latter paper provides a 
simple inductive construction of the required graph. Briefly, suppose 

mi = • • • = m,j > nij + i > • ■ • > m n > 1 

satisfies (1.1). While not obvious, it can be checked that the sequence of 
degrees 

mi, . . .,m,j-i,mj - l,m J+ i, . . . ,m„_i,m n - 1 

also satisfies (1.1) (and has smaller total degree). A graph with the origi- 
nal degree sequence can be constructed from one with the modified degree 
sequence either by adding one edge or by deleting two edges and adding 
three edges, according to whether the edge (j,n) is already in the graph 
with smaller total degree. 

To state our main result, let D be a positive integer valued random vari- 
able and let Di, . . . ,D n be an i.i.d. sequence of random variables with the 
distribution of D. Let A be the set of limit points of the sequence of proba- 
bilities 

P((Di, . . .,D n ) is graphical). 

If < P(D is even ) < 1, then the parity restriction forces A C [0, |], as we 
see below. We are interested in finding sufficient conditions for A = {0}, A = 
{i}, or AC (0,i). Our main result is given below. To simplify its statement, 
we consider only the case in which D takes both even and odd values with 
positive probability. 

Theorem. Suppose thatO < P(D is even) < 1. Then the following state- 
ments hold: 
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(a) 7/lim n _ 

(b) //lim n _ 



,nP(D >n) = oo, then A = {0}. 

, nP{D >n) = c, where < c < oo, and 



(1.2) 

then A = {0}. 
(c) I/lim n 

(1.3) 

and 
(1.4) 



then A C (0, \). 

(d) // ED < oo or if 

(1.5) 

then A = 



E 



71 



P(D > n) 



oo, 



j nP(D > n) = c, where < c < oo , 
< lim inf r?P(D = n) < lim sup n 2 P(D = n) < oo 



E 



--P(-D>n) 

71 



< OO, 



sup n(log n)P{D > n) < oo, 



Note that it follows from parts (b) and (c) of the Theorem that A is not 
monotone in the distribution of D. The main interest in part (d) is the case 
in which ED < oo. We include the result when (1.5) holds primarily because 
this shows that ED < oo is not necessary and sufficient for A = { i } . 

Another version of this problem was considered earlier. To describe it, 
recall that a partition of a positive integer N is a sequence of integers 71 > 
72 > • • • > In > so that 

n 

1=1 

In 1982, Wilf conjectured that if Tn is chosen at random uniformly from all 
partitions of N, then 

lim P(Tn is graphical) = 0. 
This conjecture was proved by Pittel (1999). 



2. Proof of the Theorem. This section is devoted to the proof of the 
Theorem. The order in which the four parts are stated and proved is ac- 
cording to the size of the tails of the distribution of D — from long tails to 
short. However, the proof of part (c) is much longer than the proofs of the 
other parts, so you might want to read the proofs in the order (a), (b), (d) 
and (c). 
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Before entering into the main part of the proof, we deal with the easy 
parity issue. If P(D is even) = 1, then clearly P{D\ + • • • + D n is even) = 1 
as well. Similarly, if P{D is odd) = 1, then P(D\ + • • • + D n is even) = or 
1 according to whether n is odd or even. 

Proposition. I/O < P(D is even) < 1, then 

lim P(D\ + • • • + D n is even) = h. 

Proof. The result follows immediately from 

2P(£>i + • • • + D n is even) - 1 = £((-l) Dl+ - +D ") = [E{-l) D ] n 
and the fact that \E(-l) D \ < 1. □ 

We turn now to the proof of the Theorem, for which it is useful to re- 
state (1.1) in a more convenient form. Let M n (l) > M n (2) > ■•■ > M n (n) 
be the decreasing rearrangement of D±, . . . , D n . Then the inequality in (1.1) 
becomes 

3 n 
(2-1) Y / M n (i)<j(j-l)+ J2 min(i,M n «), 

i=l *=i+i 

which can be written as 

3 

(2.2) £ Mn (0<j(j-l)+ Yl HM n (i)>i}i 

i=l t<l<3<i<n 

where 1a denotes the indicator of the set A. 

Proof of part (a). Considering only the case j = 1 in (2.2), we see 
that 

P((D 1 , . . . ,D n ) is graphical) < P(M n (l) < n - 1) = [1 - P(D > n)] n , 
which tends to as n — > oo by assumption. □ 

The proofs of parts (b)-(d) of the Theorem are based on a useful repre- 
sentation of M n (l), . . . , M n (n). To describe it, let Xi^X^i ■ ■ ■ be independent 
unit exponentials, and let g:[0,oo)— ►{1,2,...} be a right continuous, in- 
creasing function. Take Di = g(l/Xi), so that 

(2.3) P(D > n) = 1 - exp 



9 1 {n), 

where <? _1 (n) is defined by g(u) > n iff u > g~ l (n). Note that the distribution 
of Di uniquely determines the values g~ l {n), and hence uniquely determines 



G 



R. ARRATIA AND T. M. LIGGETT 



the function g, since g is taken to be increasing and right continuous. This 
construction of the D^s is a variant of the familiar quantile construction 
of an arbitrary random variable X as X = F~ l (U) in terms of a random 
variable U that is uniform on [0,1]. 

For fixed n, let X^ , . . . , Xr n -\ be the increasing rearrangement of X\ , . . . , X n . 
Then 

M n (i)=g(^~y l<i<n. 
Elementary properties of the exponential distribution imply that 



(X {1 ), . . . ,X (n) ) = ( 



d ( X\^ Xi ^ X2 X\ ^ X2 x n 
n n n — 1 n n — 1 1 



where = denotes equality in distribution. [This useful representation was 
first observed and exploited by Epstein and Sobel (1953) and Renyi (1953).] 
It follows from this that 

(M n (l),M n (2),...,M n (n)) 

*(•((*)>((* + £)>■■. 

ffX 1 X 2 X,, ^ 

g[ — + — - + ••• + — 

V \ n n — 1 1 
As usual, we let S n = X\ + • • • + X n denote the partial sums of the AVs. 

Proof of part (b). Since nP(D > n) — ► c, (2.3) implies that 5~ 1 (n) 
n/c and hence that 

(2.5) lim iQ = c 

x— >oo x 

By (2.4) and (2.5), 

'M n (l) M n (j)\ d / c 



(2.6) 



n n J \b\ bj 



, . . . , 



for j > 1, where — ► denotes convergence in distribution. Statements of this 
type are well known, of course; see, for example, Section 2.3 of Leadbetter, 
Lindgren and Rootzen (1983). However, especially in the proof of part (c) of 
the Theorem, we need much more detailed information about the asymptotic 
behavior of (M n (l), . . . , M n {n)) than such results provide. 
By the strong law of large numbers, 

2 n 1 n 

Ji™- E HM n (i)>i } = -£i{M„(o>i} 

i=j+l i=l 
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1 n 

= lim - J]t {D . >l} = P(D>l) a.s. 

for each j. Dividing (2.2) by n and passing to the limit as n — > oo, it follows 
that 

limsupP((Di, . ..,D n ) is graphical) < P [ ^ < ^ P(L> > i) for all j > 1 j . 

n-+oc Vi=l 6i i=l / 



So, it suffices to show that 

f)(-|-P(D>i))=oo a.s. 
This follows from (1.2), since 



which converges absolutely a.s. by the law of the iterated logarithm applied 
to the sequence S±,S2, □ 

Proof of part (c). We begin by observing that (1.3) implies that there 
is a constant C so that 

(2.7) \g{u)-g{v)\<C\u-v\ + l 

for all u,v > 0. To see this, use (1.3) and (2.3) to conclude that there is an 
e > and an N so that 

g-\n + l)-g-\n)>e 

for all n> N. Now take < u < v and let m = g(u),n = g(v). Without loss 
of generality, we may assume N < m < n since (2.7) is automatic if n = m 
and once we know (2.7) holds for large n, v, it can be made to hold for other 
u,v by increasing the value of C. In this case, 

g~ l (m) < u < i g~ 1 (m + 1) and g~ l (n) < v < g~ 1 (n + 1), 

so that 

v — u> g~ 1 (n) — (7 _1 (m + 1) > e(n — m — 1). 

This gives (2.7) with C = e~ 1 . 

In what follows, we use O(l) in distributional statements to represent 
possibly random quantities that are bounded by a universal constant. In 

limiting statements of the form Y n — > Y + 0(1), the meaning is that the 
sequence of distributions of Y\ , Y2, . . ■ is tight and that all weak limits of the 
distribution of Y„ lie between the distribution of Y± a universal constant. 
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Since the proof of part (c) is quite long, we break it up into several lemmas. 
The first determines the asymptotics of the distribution of the left-hand side 
of (2.1), when centered appropriately. The second lemma carries out the 
additional analysis that is needed to deal with the right-hand side of (2.1). 
The final lemma provides bounds for the nonrandom quantities that arise 
in the centering in the first two lemmas. □ 



LEMMA 1. For every N > I, 

J^fM n {i) 1 



-In 



max > I 

N<j<n^ ] 



. 1=1 



n-l + l 



i=l 



1 1 



Proof. In the proof, we need to treat the summands above separately 
in three regimes: i<§,§<i<« — n e and n — n e < i < n. We begin with 
some estimates that are needed in all three cases. Define 



M n (i) 1 | 
ZnW = g\ 

n n ' 



v — - — 



for 1 < i < n. Then jointly in i, 



n\Z n (i)\ 



(21 



-In 



9 £ 



. 1=1 



n-l+l 



E 



< c 



E 

i=i 



n-l + l 



\ i=i 



n-l + l 



l£ 



J=1 n-l + lf^n-l + l t 



+ 0(1) 



by (2.4) and (2.7). To bound the right-hand side of (2.8), sum by parts to 
get 

^ Xj-l *(Si-l)- -1 + 1) 
^n-l + l 



l=i 



l=i 

Si -i 
n — i + 



n-l + l 



i-l 



i-£ 



Si -I 



^(n-l^n-l + l) 
where again Sk = X\ + ■ ■ • + X^. Therefore, letting 

\S k - k\ 



S = sup 



fc>l y/MogJW+l) 
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which is finite a.s. by the law of the iterated logarithm, we see that for i < n, 

Xi-1 



(2.9) 



l=1 n-l + l 



n — i + 1 



Combining (2.8) and (2.9) gives 



n\Z n (i)\ < 2CSVilog(i + l 
(2.10) 



.1=1"-* + - / l= i 

provided that the denominator is positive. Here again, the distributional 
inequality holds jointly for 1 < i < n. 

For 1 < i < § , we may replace the n — I + 1 by n and the n — i + 1 by § 
on the right-hand side of (2.10) and divide by n to get 



(2.11) \Z rM < 4 ^v^g^ + lQ(l), 

again provided that the denominator on the right-hand side is positive. 
By (2.5) and (2.6), 

1 1' 

.Si i. 

jointly for any finite number of i's. Therefore, 

j , / J 



Z n (i) -i c 



max 

N<j<N+m 



y^Z n (i)-^c max ( Yj — r\ 



for every N,m> 1 . We wish to have a version of this statement that corre- 
sponds to m = oo; for this, we need some domination. This is provided by 
(2.11) as we now argue. 

Note first that the denominator of the first term on the right-hand side 
of (2.11) is positive for sufficiently large i (depending on the value of 5). 
Therefore, since 



^ y/ilog(i + l) 

E ~2 <0 °> 



i=l 



it follows that there is a function /(s) so that i > f(s) implies % — 4sy / ilog(i + 1) > 
and 



i>f(s) + 
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Combining this with (2.11), it follows that there is a constant K [that can 
be taken to be the sum of the constant in the 0(1) term in (2.11) and C] so 
that 

P(N + m>f(S))<p( Yl \Z n (i)\<K). 

\N+rn<i<n/2 I 

The final ingredient is the inequality 

3 3 

max > (i) < max > Z n (i) 
N<j<N+m ^ n ^ i - N<j<n/2 ^ i 

t=i — 1 i=i 

3 

N<T<N+m * 

«=1 N+m<i<n/2 

for N + m < Combining these observations, the conclusion is that for any 
N> 1, 

(2.12) max VZ B (i) 4 csup f 1 - - )) + O(l). 

Next, we consider terms that correspond to n — n e < i < n, where < 6 < 
1. For such an i, the right-hand side of (2.8) is at most 

H<f^))/(? J ^£° J ^) +0(1> ' 

where |_"J is the greatest integer function. Since 

n y 

\ - A; 

has mean that is asymptotic to logn and variance that is bounded in n, 

t-^l n _ 7 _l_ 1 



log n n—l + 1 

in probability. Similiarly, 

1 L "^f J 



y — ^ — a 

log n n — I + 1 

in probability. Therefore (2.13) is asymptotic to 

2C 

{l-9) 2 \ogn 
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It follows that 
(2-14) £ 



n—n e <i<n 



n n > > *— ' 



l=i 



n-l + 1 



O(l). 



Finally, we consider the intermediate terms ^ <i <n — n s . In this case, 
1 



E 

Z=l 



n - Z + 1 



n + 1 2n + 2 

> lOg ; > lOg ■ 



n — i + 1 



n + 2 



and n — i + 1 > n e , so that the right-hand side of (2.10) is at most 

2n + 2 



(2C5Vnlog(n+l)) / (\n e log 



n + 2 



(2.15) 

provided that 
(2.16) 



2SVnlog(n + 1) ) log 



2n + 2 
n + 2 



+ 0(1), 



2 n -L 2 

n e log > 25v / nlog(n + 1). 

n + 2 v 7 



If 2 < # < 1) (2-16) holds eventually a.s. and then (2.15) tends to zero as 
n — > oo . It follows that 

(2-17) £ 

n/2<i<n— rfi 



n n 1 1 



2=1 



n- Z + 1 



0(1) 



in probability. Combining (2.12), (2.14) and (2.17), we see that for every 
N> 1, 



(2.18) 



max > 

N<j<n^ 



M n ii) 1 



I? 



I? 



1 



-In 



. (=1 



n-Z + 1 



c sup | 



,i=l 



as required. □ 



To prepare for the second lemma, define 



7_ 

i=l 



7(j»=Eff E 



1 



. z=i 



n-Z + 1 



+ E 



5 mm j, 



=1 



E 

. Z=l 



n - / + 1 



ni?min(j, D). 
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Lemma 2. For every N > 1, 
j 



1 

max — 

N<j<n n 



£M„(i)-j(i-l)- J2 min(i,M n ,(i))- 7 (j» 

i=l i=j'+l 



p(E(4-7])+°( 1 )- 



i=l 



Proof. Write the right-hand side of (2.1) as 

n j 

(2.19) j(j - 1) + J2 min 0'> A) - £ min(i, M n (i)). 

i=i j=i 

The last term is easy to deal with. Since | min(j, u) — min(j, v)\ < \u — v\ , the 
estimates in (2.8), (2.11), (2.14) and (2.17) apply just as well when M n (i) 
is replaced by min(j, M n (i)) and 

) ) is replaced by glminlj, 



.9 

Now, however, (2.6) is replaced by 



/ min(j,M ra (l)) min(j,M w (2)) mm(j,M n (j)) \ d Q 
V n n I 

Therefore, (2.18) is replaced by 

j 

(2.20) max V 



i=l 



min(j,M n (i)) 1 ( ■ ( ■ (sr 1 > \ ^ 
g I mini j, }^ 



n n \ \ yjz^n — I + 1 



io(i) 



To handle the first two terms in (2.19), we need the asymptotics for mo- 
ments of min(j, D) that follow from nP(D > n) — > c: 



Emm(j,D) = £ P(D > I) ~ clog j, 
(=1 

E[mm(j, D)] 2 = j^(2l - 1)P(D > I) ~ 2cj, 
S[min(i,£»)] 3 = ]T(3Z 2 - 3/ + 1)P(L> > Z) ~ fcj 2 , 



j 



E[uan(j, D)f = £(4Z 3 - 6l 2 +41- 1)P(D > I) ~ fcj 



3 
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E 



- ^[min(j, Di) - Emin(j, D)} 



i=i 



Now take \ < < §. Then 



E max 

l<j<n 6 



-1 1 



[min( j, Di)-E min( j, D)] 



so that 
(2.21) 



max 

l<j<n" 



<o E 

l<j<n e 



0. 



- V[min(j, Di) - Emm(j,D)\ 



i=l 



On the other hand, 



mm — 

n e <j<n U 



i=l 



n e (n e - 1) , , 

> — - £min(n,.D), 



n 



which tends to oo. Combining this with (2.6), (2.11), (2.14), (2.17), (2.20) and (2.21) 
we conclude that for N > 1 , 



1 

max — 

N<j<n n 



J_ 

i=l 



(2.22) 



Y,M n (i)-j(j-l)- £ min(i,M„(i))- 7 (j, 
i=j+l 



n 



c sup ( V 



1 1 

Si i 



+ 0(1), 



as required. □ 

Finally, we need to bound the n)'s below. 
Lemma 3. There exists 



(2.23) 



inf — min 7(j,n) > — oo. 
n>l n l<j<n 



Proof. Since we need only a lower bound and g is nonnegative, we 
may neglect the middle term in the definition of j(j,n). So, using (1.4), it 
sufficient to show that 
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is bounded below. 
To do so, note that 



1=1 



n-l + 1 



n — i 



so that 



-In 



1=1 



n-l + 1 



clog 1 + 



> 



»(i+l)/n 
li/n 

Hence, it suffices to prove that 



1 



log(l - x) 



dx. 



1 



log(l — x) 



dx < oo. 



The only place where this integrability is an issue is at x = 0. Making a 
change of variables, we see that it suffices to check 

\g(y) ~ cy\ 

— 2 dy < oo. 



So, it is enough to check that 



ra Hn+i) \n-cy\ , 

/ — 2 — dy<°° 



or that 



^max(|n — eg 1 (n)|,|n — c<7 1 (n + l)|)((? 1 (n+l)— g 1 (n)) 

V <0 °- 

This follows from (1.3), (1.4) and (2.3). □ 



We are now in a position to complete the proof of part (c) of the theorem 
using the three lemmas. Combining (2.22) and (2.23), we see that there is a 
constant K so that 



/ 1 
lim inf P max — 

rwoo \N<j<n n 



J_ 
i=l 



]TM n (i)-j(j-l)- E min(i,M re (0) 
i=j+i 



< 



>P(csup( ^ 



Si i 



< —K . 
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Note that the distribution of 

yV 1 1 

j>N i=1 \Ji 1 

has support 

L SH 

Therefore N can be chosen large enough that the probability on the right- 
hand side above is strictly positive. By (2.6), for each j, 



j 



' j i ~ p{D - i) 



i=l 



J2 M n(i)~ E min(j,Af n (i)) 

.i=l i=j+l 

which has positive probability of being negative. Since all these statements 
hold jointly in j, it follows that liminf n P{A n ) > 0, where 

A n = \j2 M nii) < j(j - 1) + E min C?'< M ^)) for a11 1 < 3 < n\. 

{ i=l i=j+l ) 

On the other hand, 

P{K) < P(M n (l) < n - 1) - e" c < 1. 

It follows that all limit points of P(A n ) lie in (0, 1). 

To complete the proof of part (c), we need to show that A n and the event 

{D\ + • • • + D n is even} 

are asymptotically independent. To do so, fix m and let Q be the cr-algebra 
generated by {D^, k > m}, let A be any event, let B be any event depending 
on {D\ , . . . , D m } and let C be any event in Q. Then 

P{AC\B^C)-\P{AC\C) 

= E[t A - E{A\Q),B n C] - \E[t A - E(A\G),C} 
+ P(AnC)[P(B)-%\. 

Therefore, 

\p(A nBnC)- \P{A n c)\ < 2E\i A - E(A\g)\ + \p(b) - 1|. 

Applying this inequality to B = {D\ -\ h D m is even}, C = {D m+ i H h 

D n is even} and their complements, we have the inequality 

\P(A, Di + • • • + D n is even) - \P{A)\ 

= \P(A n b n C) + P(A nB c n c c ) - \p{A n G) - \p{A n c c )\ 

< AE\t A - E(A\G)\ + \2P(D 1 + --- + D m is even) - 1|. 
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Applying this to the A n 's above and using the Proposition, we see that to 
prove A C (0, \), it suffices to prove 

lim E\± An -P(A n \g)\=0. 

n — >oo 

To do this, let D\, . . . , D m be independent, independent of {A, D2, ■ ■ ■ } and 
have the same distribution as A . Quantities defined in terms of the sequence 

{Di, . . .,D m ,D m+ i,. . . } 

are denoted by the earlier notation with a tilde. Then it suffices to prove 
that 

lim P(A n AA n ) = 0, 

n— *oo 

where A denotes the symmetric difference. To check that this is enough, 
note that since A n and A n are conditionally independent given Q, we have 

[E\l An - P(A n \G)\f < E[l An - P(A n \G)} 2 

= ±E[l An -P(A n \g)-lj +P{A n \Q)} 2 



<E[l An -lj ] 2 <2P(A n AA n ). 

Unfortunately, it is not too easy to see that the right-hand side above 
tends to zero as required. However, we can apply these same considerations 
to certain events that are larger (resp. smaller) than these A^s to complete 
the proof. For the larger ones, simply use A n = |M n (l) < n — 1}. With this 
choice, it is clear that P(A n AA n ) — > 0. For the smaller ones, use A n = {W n < 
—e} for e small and positive, where 



1 

max — 

l<j<n n 



Y^M n {i)-j{j-l)- £ min(j,M n (z)) 

.i=l 



Then W n — W n — > in probability, since 

n m 

£|M„(i)-M n (i)|<£|A-A|. 

i=l i=l 

Therefore, passing to subsequences so that the distributional limit of W n 
[which might have some mass at —00 but has none at +00 by (2.22) and 
(2.23)], we see that P(A n AA n ) — > provided that this distributional limit 
has no mass at e. Now choose such an e. 



PROOF of part (d). By (2.1) and the Proposition, it suffices to show 
that 



(2.24) 



max 

l<j<n 



2 J2 M n (i) ~ j(j - 1) - E min C?> °i 



i=l 



i=l 



-CXD 
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in probability. We consider various ranges of j's separately. Take ^ < 9 < 1 
and consider first the range n e < j < n. Using 

3 n 

i=l i=l 

we see that 

1 j 

(2.25) max VM n (j)->0 a.s. 

—j— l= i 

by the law of large numbers if ED < oo. If we assume ED = oo and (1.5) 
instead, then (2.25) follows from Theorem 8.9 on page 68 of Durrett (1996), 
since 

Y,P(D>n 29 ) <oo. 

n 

Using the fact that j(j — 1) > n d (n e — 1) for n in this range, it follows under 
either assumption that 



(2.26) max 

n s <j<n 



2j2M n (i)-j(j-l) 



i=l 



-oo. 



To handle the other values of j, use (2.4) to write 

(2,7) X>„ (i )lX>(f)lX>(^), 
where 

5 = inf^. 

i>l I 

Note that S > a.s. by the law of large numbers. 

Assume now that ED < oo. By the monotonicity of g, 



n \ibj J(i-i)/n \xb; 



Therefore, by (2.27), 



i ^ d r 

(2.28) - V M n (i) < \ 

n . ~t a Jo 



1 

xS 



dx. 



l<i<n e 

The right-hand side of this expression tends to as n — > oo provided that 

g{l/x) dx < oo, 
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or, equivalently, that 



9(x) 



dx < oo. 



i- 



Since g(x) = n for g (n) < x < g 1 (n + 1), this is equivalent to 

, g ->+i) „ r i l 

> / -jr dx = } n — 



g 1 (n) g 1 (n + l) 



< oo. 



This follows by summation by parts from ED < oo and (2.3). So, the left- 
hand side of (2.28) tends to zero and hence 



max 

l<j<n 6 



2j2M n (i) 



n 



i=l 



-oo. 



Combining this with (2.26) gives (2.24) when ED < oo. 

Assume now that ED = oo and (1.5) holds. By (1.5) and (2.3), there is a 
constant C so that 

Cx 

g[x)<- ; it, X > 0. 



log (a; + 1) ' 



Therefore, by (2.27) 

1 

n 



E M n (i)<c E 

l<i<logn l<i<logn 



z51og 



n 

Is 



+ 1 



0. 



It follows that 
(2.29) 



max 

l<j'<logn 



J 

I 

i=l 



n 



-oo. 



log 



n 



S 



+ 1 



-l 



1 



E 7 

Ki<n e 



The same argument shows that 
- E Mn(i)<C 

ft 

l<i<n e 

which remains bounded as n — > oo. However, since ED = oo, 

1 

n 

by the law of large numbers. Therefore 

j 



min(log n, Dj) — » oo a.s. 



i=i 



max 

log n<j<n e 



2^M n (i)-^min(i,A) 
i=l 



i=l 



-oo. 



Combining this with (2.26) and (2.29) gives (2.24) as required. □ 
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